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The spatial distribution of people exhibits clustering across a wide range of scales, from household 
(~ 10 -2 km) to continental (~ 10 4 km) scales. Empirical data indicates simple power-law scalings 
for the size distribution of cities (known as Zipf’s law) and the population density fluctuations as a 
function of scale. Using techniques from random field theory and statistical physics, we show that 
these power laws are fundamentally a consequence of the scale-free spatial clustering of human popu¬ 
lations and the fact that humans inhabit a two-dimensional surface. In this sense, the symmetries of 
scale invariance in two spatial dimensions are intimately connected to urban sociology. We test our 
theory by empirically measuring the power spectrum of population density fluctuations and show 
that the logarithmic slope a = 2.04 ± 0.09, in excellent agreement with our theoretical prediction 
a = 2. The model enables the analytic computation of many new predictions by importing the 
mathematical formalism of random fields. 


I. INTRODUCTION 

Human populations exhibit remarkably simple proper¬ 
ties given the complexity of socioeconomic interactions 
between humans and their environments [1]. One such 
example is the well known Zipf’s law$2_. for cities: the 
rank of a city is inversely proportional to the number of 
people who live in the city. If the most populous city in 
the United States has a population of Af maXjU s ~8x 10 6 , 
the second most populous city will have a population of 
|Af maXiUS ~ 4 x 10 6 , the third |(V max ,us ~ 2.7 x 10 6 , 
and so forth. This simple relation fits empirical data 
extremely well(3j 0|. A mathematically equivalent for¬ 
mulation of Zipf’s law is that the underlying distribution 
of cities follows a power law [5]; namely, the probability 
that a city has a population N scales as 1/N 2 . 

The remarkable simplicity and empirical success of 
Zipf’s law have attracted significant theoretical attention 
and debatejH [6, ;7], though there is no consensus on the 
origin of Zipf’s law. Existing work treats cities as the 
fundamental entities of the theory, with population as a 
property of each city. For example, Gibrat’s law applied 
to cities |8i Si, which states that the fractional growth 
rate of a city is independent of its population, will drive 
the distribution of city populations to a log-normal dis¬ 
tribution. The tail of the log-normal distribution then 
gives rise Zipf’s law. 

Our approach is conceptually different: we treat the 
population density as the fundamental quantity, thinking 
of cities as objects that form when the population den¬ 
sity exceeds a critical threshold. The situation is there¬ 
fore conceptually and mathematically analogous to the 
formation of galaxies in the universe, where non-linear 
gravitational collapse occurs when the matter density ex¬ 
ceed some critical value. Our conceptual advance here is 
also a practical one, since we can apply the mathemati¬ 
cal tools developed for analyzing random fields Q2] to the 
problem at hand. 

Before proceeding with a technical derivation of our re¬ 


sults, let us briefly summarize them. The starting point 
is to model human population density as a random func¬ 
tion of spatial position. A function of spatial position is 
a field , and thus human population density will be mod¬ 
eled as a random field (for a review of relevant topics in 
random fields, see mm and especially USD- To lowest 
order, a single random variable in elementary statistics is 
characterized by a mean and a variance. A random field 
may be regarded as a higher dimensional generalization 
of a single random variable. By analogy, a random field 
is characterized by a mean and a power spectrum , which 
can be thought of as a generalization of variance. The 
power spectrum gives the amount of fluctuations of the 
field as a function of scale. To derive the form of the 
power spectrum for human population density, we in¬ 
voke scale-invariant random growth, similar in spirit to 
Gibrat’s law. 

We move on to derive Zipf’s law. Our derivations 
involve the simple assumption that some cities emerge 
above some critical population density threshold. To 
count the number of cities in our model, one must an¬ 
swer the following mathematical question: given a ran¬ 
dom field characterized by a power spectrum, how often 
does the random field take on values greater than a cer¬ 
tain threshold? This is a frequently asked question in 
the context of cosmology, and the Press-Schechter (PS) 
formalism allows us to analytically compute the answer. 

We demonstrate that our derivation of Zipf’s law is 
more general than the motivating random growth model; 
we argue that the only key ingredient is scale invariance 
in two spatial dimensions. In other words, whereas previ¬ 
ous work tends to focus on how Zipf’s law emerges from 
concrete models, we argue that Zipf’s law naturally oc¬ 
curs in a very large class of statistical models. In the lan¬ 
guage of statistical physics [12]. the existence of Zipf’s law 
is only a function of the universality class of the statisti¬ 
cal model; it is independent of the “microscopic” details 
of the system’s dynamics which are undoubtedly complex 
in the case of human populations. 
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II. DERIVATION OF ZIPF’S LAW 

We now proceed with the detailed derivation. To start, 
consider the human population density p as a function 
on ]R 2 , the 2D Euclidean plane. Since we will be inter¬ 
ested in regions much smaller in size than the radius of 
the Earth, we will ignore the effects of curvature. The 
fluctuations relative to the average population density 
<5(x) = \{pfx)/p) — 1] can be expanded in Fourier modes 

Up to a conventional normalization factor of 27r, this 
equation simply rewrites the population fluctuations as a 
sum of plane waves e“* kx , each weighted by a factor 
Since the left hand side is a random variable, the right 
hand side must also be a random variable; since every 
term except for dt on the right hand side is manifestly 
deterministic, <5 k must be a continuum of random vari¬ 
ables, with one random variable for each wave vector k. 
Just as an ordinary random variable is characterized by 
a variance, each is characterized by a number P(k) 
called the power spectrum, which is defined as 


over-densities can grow or shrink spatially, executing a 
random walk in A'. This process can continue until the 
overdensity disappears (A = 0), or the over-density takes 
up some maximum X max , where X max = A'(/c m ; n ) is set 
by the continental length scale ~ l/fc m in- For a large 
ensemble of over-densities, this is a diffusion-like process 
with reflecting boundary conditions obeying 


dA _ d 2 A 
1ft ~ D dX 2 


( 2 ) 


with some diffusion constant D. We are only interested 
in the late-time behavior of equation (2). Any initial 
conditions will relax to the steady-state solution A (A) —► 
constant for 0 < X < X max on a timescale T re i ax ~ 
Xmax/D. We intuitively expect T re i ax to be reasonably 
short, since the geographic mobility timescale of ~ 5 yrs 
(in the United States, ~ 35% of people change residences 
within 5 years IT5] ) is considerably shorter than, say, the 
population growth timescale ~ 30 yrs set by the typical 
age of parenthood. Any initial conditions set by antiquity 
or perturbations to the system (e.g. catastrophic events 
that displace many people) should be quickly erased. We 
therefore predict that on sufficiently long timescales, 


(6 k 6* k ,) = (2n) 2 6 2 D (k-k')P(k), (1) 


P(k) oc k 2 . 


( 3 ) 


where Sd is the Dirac delta function (not to be confused 
with the fractional over-density <5(x). By assuming rota¬ 
tional symmetry, the power spectrum becomes a function 
only of magnitude P(k) = P{k). Equation (1) makes pre¬ 
cise the statement that the power spectrum P(k) quanti¬ 
fies the amount of statistical fluctuations associated with 
a given frequency k. 

It is conventional to define a dimensionless power spec¬ 
trum in the number density A 2 (k) = k 2 P(k)/(2n), which 
represents the typical (squared) fractional over-density of 
people ( 5p/p ) 2 on the spatial scale ~ 1/k. To make fur¬ 
ther progress, we must fix the functional form of A(A;) 
by some theoretical principle. To this end, consider an 
over-density of size ~ 1/k. At a discrete time step, this 
over-density might grow or shrink in spatial coverage. As 
a concrete example, consider a collection of farms (with a 
characteristic population density of a few people per typ¬ 
ical farm area) in otherwise relatively uninhabited coun¬ 
tryside. At each time step, a farm could be added or 
destroyed. In this way, our unifying principle of random 
walkers is conceptually similar to previous work on the 
random growth of firms [14] . Therefore, the spatial size of 
the over-density might grow or shrink, while Sp/p (a num¬ 
ber associated with farms) will be held constant. More 
precisely, we define a monotonically decreasing function 
A (k) such that lim^oo X = 0, which quantifies the spa¬ 
tial extent of an over-density. This function might rep¬ 
resent the area of the over-density X(k) oc 1/fc 2 or its 
perimeter X(k) oc 1/k, but our derivation will not de¬ 
pend on the detailed form of A. We can then perform 
a change of variables and view A(k) as a function of 
A: A(A (k)) = A(k). The unifying principle is that all 


We test this prediction in Figure 1 against publicly avail¬ 
able data from the Center for International Earth Science 
Information Network (CIESIN) and Centro Internacional 
de Agricultura Tropical (CIAT)[TSj. We find the best fit 
slope P(k) oc k~° to be a = 2.04 ± 0.09, where we have 
reported the ±ler uncertainties. Our theoretical predic¬ 
tion is therefore in excellent agreement with observations 
across a broad range of spatial scales, from a few km to 
~ 10 3 km. 

Before further developing the theory, a more intuitive 
derivation of P(k) ~ fc -2 is worth mentioning. Over a 
large range of length scales our model is scale free, imply¬ 
ing P(k) ~ k~ a for some a. In d spatial dimensions, the 
left hand side of equation (1) has units of k~ 2d , the Dirac 
delta function has units of k~ d , so P(k) should have units 
of k~ d . Since there are no other dimensional parameters 
relevant to our theory (the diffusion constant has units 
of [A] 2 T _1 , but there are no other constant with units 
of time T), we must have a = d = 2 in two spatial di¬ 
mensions. In this sense, geometry and scale invariance 
uniquely determines the slope of the power spectrum. 

In fact, this simple argument demonstrates that 
P(k) oc k~ 2 is a universal feature of 2D models that have 
no parameters with units of length to some power. Effec¬ 
tive field theory, a powerful technique for studying any 
statistical physics system, can be used to further sharpen 
this statement; this is done in Appendix D. The deriva¬ 
tion in Appendix D provides perhaps the most rigorous 
way of justifying the statement that P(k) oc k~ 2 is a 
generic property of models with scale invariance in two 
dimensions, since effective field theory should capture the 
equilibrium properties of any statistical model on scales 
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FIG. 1. (Color online). Empirically measured power spec¬ 
trum P(k ) oc |<$fc| 2 ~ k~ a of population density fluctuations 
as a function of the spatial wavenumber k. The best fit slope 
a = — 2.04±0.09 (solid blue line) is virtually indistinguishable 
from the predicted slope a = —2 (dashed orange line). The 
data was obtained by taking the diagonal entries (to avoid 
anisotropy from rectangular gridding) of a discrete Fourier 
transform of a 1000 x 1000 arcmin 2 map of the population 
density of a section of the continental United States. The 
area was selected to minimize artifacts due to boundary con¬ 
ditions defined by lakes and oceans. 


smaller than the system scale but sufficiently large such 
that densities can be approximated by smooth functions. 

With a power spectrum P(k) in hand, it is possible to 
calculate the number of cities as a function of their pop¬ 
ulation N. We picture cities of area A as discrete objects 
which form when the population density as a function 
of spatial coordinates p(x), or equivalently S(x), aver¬ 
aged over an area A surpasses a critical threshold, Sc- In 
other words, we choose the surface area A such that the 
total integrated population N = f xeA p(x) d 2 x = pc* A, 
where the critical density pc = p(l + 5 C ). 

This is shown pictorially in Figure 2. [This assump¬ 
tion can be relaxed, allowing for the average population 
density of a city to vary systematically with size. In our 
model, this corresponds to a critical threshold that varies 
with A. In this case, the excursion set formalism can be 
used with a moving barrier [T?]. We will ignore this sub¬ 
tlety, since Zipf’s law is still obtained in the limit that 
Sc <C o’. Also, since the numerical value of the threshold 
is not fixed, we could also consider the case where the 
threshold varies by country. Again, Zipf’s law would be 
obtained for each country.] 

The counting of cities is now a well-posed question. 
Computationally, one could find the number distribution 
of cities with the following algorithm. Generate via a 
Monte Carlo procedure many realizations of the random 
field with mean 0 and power spectrum P(k) = Pok~ 2 . 
Find the regions where the random held exceeds a certain 
threshold. Measure the size of each region, and multiply 
the area of each region by the population density thresh¬ 
old; define this to be the population of each city. Repeat 
for many Monte Carlo iterations, and then make a his¬ 


togram of the size distributions of each region. One can 
verify numerically that the resulting number distribution 
n(N) would scale approximately like 

n(N) oc N~ 2 , (4) 

where N is the population of the city and n(N) is the 
number density of cities of size N. However, we can in 
fact show analytically that the number distribution takes 
this form using the Press-Schechter (PS) formalism [TUI, 
traditionally used in the context of cosmology to predict 
the abundance of gravitationally-bound objects given a 
power spectrum of the fluctuations in the cosmic mat¬ 
ter density. However, we emphasize that the formalism 
is in essence a purely statistical one, which does not re¬ 
quire or employ any facts from cosmology. The excur¬ 
sion set formalism [IS] provides a more rigorous deriva¬ 
tion, but the PS formalism has the benefit of simplicity. 
The end result is identical in either case. We provide a 
self-contained proof of equation (4) in Appendix A. 

By integrating equation (4) with respect to N, we 
find that the number of cities above a certain population 
threshold scales inversely with the population threshold. 
This statement is equivalent to Zipf’s law: the rank of a 
city is inversely proportional to its size. 

III. CONCLUSION 

In summary, we have presented a derivation of Zipf’s 
law and successfully predicted the power spectrum P(k) 
of population density fluctuations in the continental US. 
These derivations stemmed from two fundamental ingre¬ 
dients: scale-invariance and 2D geometry. Remarkably, 
there is a wide range of possible models and an even 
wider range of initial conditions to which our results are 
insensitive. One such model involves random walks of 
the sizes of clusters of people on all scales, which can be 
viewed as a vast generalization of Gibrat’s law. How¬ 
ever, as we have emphasized, even this generalization is 
still a relatively specific example in the class of all models 
which will lead to Zipf’s law.This shows that the origin 
of these laws is fundamentally the scale-free nature of 
clustering in human populations. This is an appealing 
feature, enabling us to forgo any fine-tuning arguments 
in explaining the empirical data. 
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Appendix A: The Press-Schechter formalism 

The Press-Schechter formalism (for a pedagogical 
overview of the PS formalism and its generalizations, see 
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FIG. 2. (Color online). Schematic illustration of our approach. On the left, a simulated population density map with a power 
spectrum P(k ) oc k ~ 2 is displayed. Darker pixels indicate higher population densities. On the right, we select and color pink 
all pixels above a certain population density threshold from the simulated map on the left. In our formalism, a city is identified 
with each pink cluster, appropriately smoothed on the length scale of the cluster. Scale invariance implies that cities of all sizes 
appear on the map, as confirmed by visual inspection. In our formalism, the statistical size distribution of pink cluster gives 
us the population distribution of cities, which agrees with Zipf’s law. 


section 3.4 of m) allows us to answer the well-posed 
question: given a random field with an associated power 
spectrum P(fc), how often does it exceed the threshold? 
More specifically, suppose there is a class of objects (e.g. 
cities) that form when the population density exceeds a 
certain threshold p(x ) > ^threshold- Furthermore, let the 
size R of each object be defined as the maximum radius R 
such that the average population density p c ircie,R within 
a circle of radius R centered on the object is given by 

Pcircle,R = ^threshold- (Al) 

It is conventional to define a smoothed density field 

Mx)= J d 2 kW A (k) <5 k e -ikx /(27r) 2 , (A2) 


assumes that pa is a Gaussian with mean 0 and vari¬ 
ance <j 2 (A) = dkkP(k)/ (2n) oc hik/k m i n . If each 

Fourier mode is statistically independent of every other 
Fourier mode, the density field will be the sum of many 
independent Fourier modes and will therefore be approx¬ 
imately Gaussian. However, for the sake of generality, 
we will not assume that 6 is normally distributed. In¬ 
stead, we only assume that pa has a universal shape for 
all A. Since the mean of 6a is zero for all A by defini¬ 
tion, and since for any random variable 9 the associated 
standard deviation obeys a a g = aag , this allows us to 
write pa{6) = g(6/a(A))/a(A) for some general proba¬ 
bility density function g. Differentiating Ja yields 71 (A), 
the number of cities on Earth’s surface with population 
A = pA per unit area per unit population: 


where the low-pass window function WA(k) = 1 if k < 
l/\fA and HGi(fc) = 0 otherwise. This smoothed field 
is simply the original field 6{x) with the high-frequency 
fluctuations subtracted out, leaving behind the slowly- 
varying components. 

For a fixed x, <5 a( x ) is a random variable with prob¬ 
ability distribution pa{6a)- The key insight of the PS 
formalism is to identify the fraction Ja of people living in 
cities of area A or larger with the cumulative probability 
/ = 2 pa d5. This is illustrated in Figure 2. To make 
further progress, we must assume something about the 
functional form of pa- The conventional PS formalism 


n(N) 


= ~vg{v) 


p diner 
A dN 


oc 


1 g{v) 

N 2 ln(A max /A) ’ 


(A3) 


where we have defined v = Sc/a(N ), the number of stan¬ 
dard deviations associated with city formation. Note that 
for vCl, g[y) is a slowly varying function of A for two 
reasons: the first derivative of g around v = 0 is small 
for small deviations from the mode, and v is only a weak 
function of N. Thus, equation (3) implies that the loga¬ 
rithmic slope g? logn/dlog?V tends to —2 in the limit of 
N <C Amax. This limit is empirically justified, since even 
the largest cities in the world contain only ~ 10 -3 of the 
world’s population. Hence we arrive at equation (4). 
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Although our results are largely independent of 
the exact form of pa(8), let us briefly comment on 
its possible form. If pa{3) deviates from a Gaussian, 
this implies that different Fourier modes in human 
population density are correlated, a generic result of 
non-linear interactions. Note, however, that S > — 1 is 
strictly bounded from below, since human population 
density is always positive-definite: p > 0. Hence, pa 
cannot be exactly Gaussian. At some level, non-linear 
interactions must come into play. If the population 
density fluctuations were typically small 5 < 1 , one 
might expect that a Gaussian distribution could be a 
good approximation; however, everyday experience tells 
us that population density fluctuations can be quite 
large. Indeed for New York City, 5 ~ 300. Hence, 
a theory of human population density growth must 
necessarily be a non-linear. 


Appendix B: Derivation of the inverse-rank 
friendship law 

As a second application of our formalism, let us derive 
the average number of friends a person has in a given 
region. We again adopt a simple model, where we de¬ 
fine a region to be a community if the population density 
exceeds some critical value S > S c . This defines geo¬ 
graphic equivalence classes on the inhabited regions, such 
that every person is a member of a community. Since 
real-world social networks are highly clustered and only 
a small fraction of people serve as connections between 
communities of friends )20j . this assumption should be 
a good approximation for our purposes, since the more 
complicated topology of real-world friendship networks 
will mainly affect higher order quantities that involve 
friends-of-friends and friends-of-friends-of-friends. Fur¬ 
thermore, we assume that the average number of friends 
D a given person has is asymptotically independent of 
the size of the community. This second assumption is 
essentially the assertion of the existence of the famous 
Dunbar’s number 1211122j . an upper limit on the number 
of people a given person can sustain social relationships 
with. 

To compute the probability in the model, we consider 
two people A and B with Nab people closer to A than B. 
If A is a member of a community with size N c Nab , A 
and B are almost certainly friends. On the other hand, if 
A is in a community of size N c -C Nab, it will be nearly 
impossible for A and B to be friends. There is thus a 
turnover scale at ~ Nab which dictates whether or not 
A and B will be friends; the probability is therefore deter¬ 
mined by two independent events: the probability that 
A is in a community of size N greater than the turnover 
scale and the probability pp = D/N that A and B are 
friends given that A and B are in a community of size 
N, for large N D. Since we know from the previ¬ 
ous discussions that in such a model, the number density 


of communities scales asymptotically with oc 1/N 2 , and 
each community has N people, the probability p c that a 
randomly chosen individual is in a community of size N 
scales ~ 1/N. Hence, 

p(N A b) = f g(N,N A B)Pc(N)p f (N)dN 

! 1 D 1 ,B1 * 

NN dNlX Nl~ B 

where g has the properties that 0 < g < 1, g ~ 1 for 
N Nab, and g <C 0 for N -C Nab- The details of the 
function will depend on the geometry of the communities 
but do not concern us here as we are only interested 
in the scaling. We have thus derived the inverse-rank 
friendship law, previously proposed [23] to fit empirical 
data. We stress that our derivation is based entirely 
on theoretical considerations and therefore provides an 
explanation for the “physical” origin of the law. 


Appendix C: Two-point correlation function 

In this appendix, we analytically compute the two- 
point correlation function [23! £(x — y) = (S(x)S(y)), 
which is the inverse Fourier transform of the power spec¬ 
trum. The correlation function will play an important 
role in Appendix D. Physically, the correlation function 
measures the degree to which the existence of an over¬ 
density or under-density at some position x increases the 
likelihood that an over-density or under-density will be 
found at y. Assuming circular symmetry, the inverse 
Fourier transform is a Hankel transform of order 0: 

r°° U rlh- 

ar)= J 0 ~2^ P{k)J ° {kr) ' (C1) 

where Jo is the first Bessel function. Taking P(k) = 
Pok~ 2 and a long-wavelength cutoff k m gives us an inte¬ 
gral that can be written in terms of special functions 

«*•'/>«■ (C2) 

where G is the Meijer G function. Defining a reduced 
area a = k^r 2 /2 and consider separations that are small 
compared to the system size (corresponding to the scale 
of continents) a€l, we can expand 

« r ) sa g(- 7+ i(_l„ 0 + a _|. + A_])) (C3) 

where 7 « 5.7721 is Euler’s constant and we only neglect 
terms 0((k m r) 8 ). The second term guarantees that £ 

1 for sufficiently small a and £ < 0 for a > 0.51. Most 
importantly, we note that for r much smaller than the 
system size, 

Z(r) -A -^lnr. (C4) 
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Since it is possible to invert a Fourier transform, any 2D 
model which predicts a correlation function that logarith¬ 
mically diverges for small r must have a power spectrum 
of the form P oc k~ 2 3 4 5 6 7 8 9 10 11 for k k rn . 


Appendix D: Effective field theory 


The basic program of effective field theory is the fol¬ 
lowing: given a statistical physics system in a fixed num¬ 
ber of dimensions (in this case D = 2), write down the 
Hamiltonian 


H = 


d 2 xTL(6, VS, VVS,.. 


(Dl) 


such that TL contains all terms which are consistent 
with the symmetries of the system. By universality, the 
macroscopic properties of the system should then be re¬ 
flected in the field theory. For a pedagogical introduction 
to this approach, see m- In our case, the symmetries are 
particularly constraining: we want the Hamiltonian to be 
invariant under scaling operations x —> Xx in addition to 
translations and rotations of the Euclidean plane. Un¬ 
der a change of scale, the population density transforms 
like a scalar, so S(x) —> S(X~ 1 x) while d 2 x -A A 2 d 2 x and 
VS -A A _1 V<5. Hence scale invariance requires that each 
term in TL contain exactly two derivatives to cancel the A 2 
from the area element. Rotational symmetry then limits 
us to only one possible term (VS) 2 : 

H= l -Jd 2 x(V5) 2 , (D2) 


which is simply a free scalar field in two dimensions. 
Adding any interaction term to TL of the form V (<5) is not 
allowed, as d 2 x V(5) would not transform correctly un¬ 
der a scale. Using standard field theory techniques, one 
can show that the correlation function £(r) has the form 
of (C4); hence for wave vectors k 3> k m (corresponding 
to physical scales shorter than the system size), we must 
have that the power spectrum P oc k~ 2 . This concludes 
our proof that a scalar random held in 2 spatial dimen¬ 
sions will have a power spectrum P oc fc~ 2 . 

Let us make some further comments about (D2) that 
may be helpful to readers unfamiliar with effective held 
theory techniques. In particular, we can use (D2) to con¬ 
struct alternate theories that also will yield power spectra 
P oc k~ 2 . For example, the generic Langevin equation, 
specialized to the case where the Hamiltonian is given by 
(D2), is iust the famous diffusion equation with a noise 
term[T2]: 


— = -AV 2 <5 + 77 , (D3) 

where S is the fractional population over-density and 
is a fluctuating random variable. Notice that while 
this model also involves a diffusion equation, it is con¬ 
ceptually distinct from equation (2). Here we think of 
population as physically diffusing in Cartesian space; in 
equation (2), the diffusion is not happening in Carte¬ 
sian space but in Fourier space. Since (D3) was derived 
from a Langevin equation corresponding to the Hamilto¬ 
nian (D2), its equilibrium properties must be described 
by (D2); hence it follows that P(k) oc k~ 2 . Note, how¬ 
ever, that the noisy diffusion model lacks parameters di¬ 
mensions of length to any power. Thus, our simple di¬ 
mensional analysis argument presented after equation (3) 
still holds, and provides a simpler derivation of the power 
spectrum. 
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